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HEADPHONES WITH INTEGRATED MICROPHONES 

BACKGROUND OF THE INVENTION 
Field Of The Invention 

[0001] The invention relates to a sound reproducing system 
comprising headphones with sound generating means and means for 
5 controlling the sound signal generated by said headphone sound 
generating means to simulate external sound sources. 

[0002] The invention also relates to a headphone for a sound 
reproducing system, 

10 Description Of The Related Art 

[0003] Headphones are used in and for audio equipment, such as 
(mobile) CD-players, but also in call-in centers. 
[0004] The headphones comprise a means for generating sound 
(usually a small loudspeaker) . A recorded sound signal (voice or 
15 music) is sent to the headphone (s) and sound generators inside the 
headphone generate a sound. The listener will, however, perceive 
the generated sound as being generated inside or very near the 
listener's head (which in fact it is), unless the sound signal is 
adapted. Such a sound is perceived to be unnatural. It is known to 
2 0 process the signals such that the perception of the sound signal by 
the listener is such that he/she believes to hear external sound 
sources, i.e., the listener perceives a more natural sound. To 
achieve this, the signals are processed through a filter set to 
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alter the characteristics of the signal such that the sound 
generated near, or within, the head simulates an (or more than one) 
external sound source (s). An important aspect in this respect is 
the transfer characteristics of sound by an external source to the 
5 head and the pinnae of the ear itself, the so-called Head Related 
Transfer Function (HRTF) , i.e., the manner in which sound becomes 
attenuated and altered by the head and pinnea itself before it 
actually is heard. Attempts to process the signals taking into 
account the HRTF to obtain external source simulation, are known 
10 from J. Acoust . Soc . Am. 85(2), pages 858-878, F.L. Wightman and D. 
Kistler, Feb. 1989: 'Headphone simulation of free-field listening I 
and II • . 

[0005] Such attempts, however, do not always prove to be 
successful. The HRTF is dependent on the actual shape and form of 
15 the head and the ear and differs substantially from one person to 
another. Furthermore, head movements complicate matters as they 
also influence the sound perception. 

[0006] Japanese patent application JP 08/079900 A discloses 
providing the headphones with measuring devices to measure the 

2 0 distance between the ears, the height of the head and head 

movements. Although such measurements can be used to improve the 
sound reproduction, the results leave room for improvement. The 
HTRF is a strongly individual one which can only be approximately 
determined using the result of such measurement. Likewise the 

25 effect of head movements can only be approximately determined. 
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SUMMARY OF THE INVENTION 
[0007] It is an object of the invention to provide a sound 
system as described in the opening paragraph with improved sound 
reproduction. 

5 [0008] To this end, the system is characterized in that the 
headphones are provided with microphones, and the means for 
controlling comprises, or is coupled to means for regulating the 
sound production by the headphone sound generating means such that 
a signal registered by the microphones is substantially zero when 

10 at least one external sound source is operative in response to a 
signal, and means for recording the results of said regulating to 
influence external source simulating sound generation in the 
headphones and/or means for regulating the sound production by the 
headphone sound generating means, such that the difference between 

15 a signal registered by the microphones and a known signal is 

substantially zero, and means for recording the results of said 
regulation to influence external sound simulating sound generation 
in the headphones. 

[0009] Each headphone is provided with a microphone. The 
2 0 microphone, which is located near or preferably in the ear, 

registers the sound generated by the headphone sound generating 
means as well as, in one aspect of the invention, by the at least 
one external source. The system comprises means for regulating the 
sound production by the headphone sound generating means such that 
25 the microphone registers a substantially zero signal when. 



S : \GO\SL17GOA0 . GOR 




PHN 17,814 

simultaneously, at least one external source, in response to a 
signal, and the headphone sound generating means are active. The 
headphone then generates a, as far as the human perception is 
concerned, same auditive signal but of opposite sign as the 
5 external source (s). The system includes means for recording the 
results of the regulation. Thereafter, when the external source (s) 
(is) are shut off, or removed altogether, the sound perceived by 
the listener is the same as that for the external sources. The 
signal registered by the microphone will be equivalent to that when 

10 only the source would be operative. The relation between a signal 
sent to the source, such as a loudspeaker, and the signal sent to 
the headphone sound generating means to simulate such an external 
source, is then known. The data from the above-mentioned regulation 
are used for regulation of the sound signal to the headphones in 

15 such manner that the external source is simulated. 

[0010] The relation between a signal sent to an external source; 
a signal sent to the headphone sound generating means and a 
microphone signal are thus measured. Such measurement does, 
however, not only give the relation between signals a (external 

2 0 source signal) and b (equivalent headphone signal) , but also 

between signals b (headphone signal) and c (microphone signal) and 
signals a (external source signal) and c (microphone signal) . These 
known relations can also, or separately, be used in another aspect 
of the invention as follows. 



S : \GO\SL17GOA0 . GOR 



1 

I 

m 



PHN 17, 814 

[0011] Once, for a * standard head' or, in fact, for any head, 
the relations between signals a, b and c have been established, it 
is not, in all circumstances, i.e., for other heads, necessary to 
make further use of an external source with signal a. It suffices 
5 to know (and this is known) the microphone signal c corresponding 
to a particular external source signal a to regulate headphone 
signal b, if needed. When the headphone sound generating means 
'truly' (signal b) simulates an external source (signal a), a 
particular microphone signal (signal c) should be registered. This 

10 is the case on the 'standard head'. However, when the headphone is 
put on another head, the HRTF will be different and the same signal 
b sent to the headphone sound generating means will generate a 
microphone signal c' different from the particular microphone 
signal c because of the different HRTF. The system has means for 

15 regulating the signal b sent to the headphone sound generating 

means (to b') in such manner that signal c' is equal to signal c, 
for recording the regulation data, and for using the regulation 
data for further sound production to simulate external source (s). 
[0012] It should be noted that while, in embodiments, the 

2 0 headphone sound generating means and the microphone will be often 
separate elements, in some embodiments, the headphone sound 
generating means (headphone loudspeakers) may double in function as 
the microphone, especially when such headphone sound generating 
means is placed inside the ear channel. 
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[0013] Preferably, the system also comprises means for storing 
the regulation data for a specific person. 

[0014] This enables regulation data to be kept and coupled to a 
specific user. The next time this user uses the system, an incoming 
5 signal is filtered in the 'right' or at least 'nearly right' 
manner . 

[0015] These and other objects of the invention will be apparent 
from and elucidated with reference to the embodiments described 
hereinafter . 

10 

BRIEF DESCRIPTION OF THE DRAWINGS 
[0016] In the drawings: 

[0017] Fig. 1 illustrates, schematically, how to generate, from 
two real sound sources, a third so-called phantom source; 
15 [0018] Fig. 2 illustrates, schematically, a system in accordance 
with the invention; 

[0019] Fig. 3 illustrates, schematically, a further embodiment 
of a system in accordance with the invention; 

[0020] Fig. 4 illustrates yet a further embodiment of a system 
2 0 in accordance with the invention; 

[0021] Fig. 5 illustrates a still further embodiment of a system 
in accordance with the invention; 

[0022] Fig. 6 illustrates another aspect of the invention; 
[0023] Figs. 7A to 7E illustrate several embodiments of a 
25 headphone for a system in accordance with the invention; and 
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[0024] Fig. 8 illustrates, schematically, how the headphone 

sound generating means may be also the microphone. 

[0025] The figures are schematic and not drawn on scale. 

5 DESCRIPTION OF THE PREFERRED EMBODIMENTS 

[0026] Figure 1 shows a head of a person 1 with two ears 2 and 
3. Two real loudspeakers LSi (loudspeaker-left) and LSr 
(loudspeaker-right) are present in a room. With these loudspeakers, 
it is meant to generate a sound as if a sound signal is 
10 generated by a loudspeaker LSp at some other point in space. 

[0027] To calculate which signals have to be generated by the 
real loudspeakers LSi and LS^ to give the person 1 the impression 
that the sound he/she hears is generated by a (phantom) sound 
source LSp generating a signal X, the signal X has to be altered, 
15 i.e., filtered by filter function Wxl (1 for left) for loudspeaker 
LSi and by Wxr for loudspeaker LSr. 

[0028] Thus, the signal emitted by loudspeaker LSi is XWxl, and 
the signal generated by LSr is XWxr. 

[0029] A signal generated by a sound source, be it real or 
2 0 phantom, causes (for real sources) , or is supposed to cause (for 
phantom sources) , at an ear, a pressure equivalent to the signal 
multiplied by a transfer function. The transfer function (left 
loudspeaker to left ear) , Wi^ (left loudspeaker to right ear) , W^i 

(right loudspeaker to left ear) , W^^ (right loudspeaker to right 
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ear) , Wpi (phantom loudspeaker to left ear) and Vl^r (phantom 
loudspeaker to right ear) are indicated in the figure. 
[0030] The sound pressure Pi at the left ear caused by 
loudspeakers LSi and LS^ is the sum of the sound pressure XWj 
(signal to left loudspeaker) * Wu (transfer function left 
loudspeaker to left ear) + XWxr (signal to right loudspeaker) * W^i 
(transfer function right loudspeaker to right ear). Thus: 



'XL 



Likewise, the sound pressure at the right ear equals 



15 The sound pressure which would be caused by the phantom loudspeaker 
is : 



Q5i 

^ (left ear) P'l=XWpi 
(right ear) P'r=XWpr 



Substituting Pi=P'i and Pr=P'r, leads to: 



WxL= (WpiWu-Wp.Wri) / (WnW„-WirWri) 



25 WxR= (WpiWir-WprWrJ / (WirWri-WnW„) 
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[0031] 



The filter functions, which, in this simplified model, 



have been described, actually have to be determined for each 
frequency, thus, actually, for each frequency, a filter function Wxr 
and WxL has to be determined, fixed and used. With the proper filter 
5 functions Wxr and Wxl, the listener hears the 'phantom source' LSp, 
Thus, with two loudspeakers, a 'phantom' sound source at a sound 
can be generated which, to the listener, seems to come from another 
location than the actual location of the loudspeakers LSl and LSr. 
This perception is dependent on the accuracy of the transfer 
10 functions (in this application sometimes also called 'filters' or 
•filter settings') Wxl and Wxr. 

[0032] The filters Wxl and Wxr are difficult to determine because 
the transfer functions Wn, Wi^, VI^^ and Wi^ from the loudspeakers LSi 
and LSr to the ear are difficult to determine. The transfer 

15 function for the real loudspeakers, to some extent, can be 

calculated and/or measured for a 'standard head', but, in reality, 
each head and each headphone is different, and thus, a transfer 
function is always more or less appropriate, but never really good. 
The transfer functions for the phantom source can only be estimated 

20 or theoretically derived. Especially for the higher frequencies, 
the transfer functions are difficult to determine because of the 
shape of the head and the ear canal. In short, the Head Related 
Transfer function, HRTF, is a highly individual one. 
[0033] The transfer function needs to be calculated, and the 

25 calculation introduces errors. 
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[0034] For each frequency, the transfer function has to be 
determined, which either requires a large calculation effort and 
such calculation in itself may be a source of error, or 
necessitates the use of average transfer functions for a band of 
5 frequencies, which also introduces errors. 

[0035] All transfer functions are, to some extent, dependent not 
just on the relative positions of the sound sources (real or 
phantom) and the ears, but also on other factors, such as objects 
near the sources or ears which may reflect or alter the sound 
10 waves, and thus, influence the transfer functions. 

[003 6] Thus, there is a need to improve the sound reproduction. 

[0037] Figure 2 illustrates a preferred embodiment of a system 
in accordance with the invention. 

[0038] The system comprises two headphones each of which is 
15 provided with a microphone 6, 7. Each of the headphones has sound 
generating means 4, 5. A signal x(k) is relayed to the means 4, 5 
through filter means (i.e., modulation means) 8, 9 having filter 
setting WxL(k) and WxR(k) . In previous systems, the filters 8, 9 were 
fixed filters (as in Figure 1) and thus, the settings WxL(k) and 
20 WxR(k) were fixed. These fixed filters were usually set to be 

equivalent to an 'average head' in an 'average room'. The signals 
after the filters are indicated with ei(k) and er(k). The signals 
are §1 (k) -x (k) *Wxl (k) and er (k) =x (k) *Wxr (k) . In the system in 
accordance with the invention, microphones 6 and 7 are present in 
25 or near the headphones, and generate signals ri(k) and rr(k). The 
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signals ri(k) and raCk) result from the sum of the sound generated 
by the external source and the headphone. These signals ri(k) and 
rr(k) are fed to respective comparison and regulation means 10 
which also have respective inputs for signal x(k) and respective 
5 outputs to filter means 8, 9 for adapting or regulating settings 
WxL(k) and WxR(k) . It should be noted that in Figure 2, only the 
transfer functions Wn and W^r are shown. This will be explained 
below. 

[0039] A signal x(k) is supplied to the sound source PL and 

10 signals ei(k) and er(k) are supplied to the sound generating means 4 
and 5. The signals ri(k) and r^Ck) are fed to the regulating means 
10. This regulating means influences the settings of the filters 
WxL(k) and WxR(k) (and thereby the signals ei(k)= x(k)*WxL(k) and Srik) 
= x(k)*WxR(k)) until the microphone signals ri(k) and r^ik) (and this 

15 preferably for each, or for a chosen set or selection of 

frequencies) become substantially zero. This may be done by a step- 
wise manner, i.e., one or more parameters (one or more of the 
settings WxL(k) or WxR(k)) is (are) changed, it is then checked 
whether the signal ri(k) is increased or decreased, if it is 

20 increased, the parameter (s) is (are) changed in the opposite sense, 
if it is decreased, the parameter (s) is (are) changed in the same 
sense. This process is repeated until the signals ri(k) and r^ik) 
are substantially zero. For more details of such methods, reference 
is made to, e.g., 'Adaptive Filter Theory' by Simon Haykin, 

25 Prentice Hall, Upper Saddle River, ISBN 0-13-322760-X. It is to be 
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noted that, in general, the less parameters have to be taken into 
account in such methods, the better the result is and the faster 
the result can be achieved. When the microphone signals r^Ck) and 
ri (k) are substantially zero, the listener hears nothing. The 
5 resulting values for filter settings WxL(k) and WxR(k) are thereby 

determined. These filter settings can be, for instance, tables in a 
computer database. When the source PL is shut off or removed, the 
listener will hear a sound which, to the listener, is perceived to 
come from said source PL. Thus, the listener hears a 'phantom 

10 source' at the position of source PL. If the system is to be used 
for one person only, such tables could be the only one to use, but, 
preferably, the system comprises means (schematically indicated by 
input I in Figure 2) for storing established settings WxL(k) and 
WxpCk) for the filters 8, 9, and matches the settings to data 

15 identifying the person. The next time that same person uses the 

system, the filter will then be set correctly, or at least nearly 
correctly, for that person, provided information identifying the 
person is given to the system. In practice, tables are, for 
instance, stored in a computer database matched with a name or 

2 0 number identifying the person. 

[0040] Compared to previous methods and devices, the results are 
better and much more reliable, i.e., a much more 'natural' sounding 
and better 'localized' phantom source is heard by the listener. An 
advantage over fixed filters is that Wxl and Wxr can easily, faster 

2 5 and with much greater accuracy be determined and be adapted for 
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different locations and for different persons. For instance, if 
head transfer functions are calculated with fixed filters, often 
parameters, such as, an average height and width of an average 
head, are used. Such parameters are sometimes useless or may even 
5 give clearly wrong results if the person in question is wearing 
some head ware, such as, a hat or, for instance, has a size head 
substantially different from the average head. Even the height of 
the person's hair may be of importance in this respect. 
Furthermore, more parameters than inter ear distance and head 

10 height may be of importance for the HRTF. The present invention 

does not suffer from these shortcomings but gives reliable results 
for each person, irrespective of the size and shape of the head 
and/or ear and/or whether said person wears a hat, because all of 
these factors do not play a role due to the microphone. 

15 Furthermore, the cross transfer functions (Wj-i and Wu,) are, due to 
the nearness of the source 4, 5 to the ear 2, 3, negligible or, in 
any case, very small. This enables, in preferred embodiments, as, 
e.g., shown in Figure 2, to further greatly simplify the 
calculation, thus removing a source of error. In formula form, it 

20 holds: 



WxL=(Wpi)/(Wn) 



25 
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[0041] 



These formulae are much simplified compared to formulae 



for phantom sound generation using loudspeakers with fixed filters. 
For each ear, the filter functions are only dependent on two, not 
six, transfer functions. In fact, the determinations of the filter 
5 settings Wxr and Wxl are independent. The measurement at the left 
(right) ear suffices to determine WxL(k) (WxR(k)). This enables 
faster (less response time) and much better determination of Wxl and 
Wxr. Furthermore, the response of the acoustic paths of the 
headphones is very short (thus further shortening response time) . 

10 Furthermore, extraneous influences, such as, the shape of a room 
and objects in a room, on the transfer functions Wn, W^r (and W^i, 
Wir) are not present in headphone sound reproduction. As a 
consequence, when tests were done with a system as schematically 
shown in Figure 2 to see what the perceived difference would be 

15 between the real loudspeaker and a phantom loudspeaker, the 
location of the phantom loudspeaker was correct for both an 
anechoic room (a room in which sound reflection is reduced to a 
minimum) and a listening room (a room with normal sound 
reflection) . These results were much better than those for known 

20 systems using fixed filters. As an alternative (and this may be in 
particular of importance for source at a relatively large distance) 
instead of working with a signal coming from each microphone, the 
sum (ri (k) +rr (k) ) and difference (ri (k) -r^ (k) ) of these two signals 
could also be used. If the sum and the difference are zero, both 

25 signals are zero. Usually Wn and W^r are nearly equal (symmetric), 
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and, at large distances from the source Wpi and Wp^, are also not too 
much different. These facts are preferably used to simplify the 
calculations. It should be noted that in Figure 2, the different 
filter means (8, 9) and regulation means (10) are drawn separately 
5 to increase clarity. They may be, and preferably are, all 

integrated in one device. In certain circumstances, for instance, a 
nearly symmetrically arranged fixed position of the source, only 
one microphone may be used. The data of said one microphone would 
then suffice. 

10 [0042] Figure 3 illustrates a further embodiment of a system in 
accordance with the invention. Two loudspeakers PLi and PL2 are 
used. For both loudspeakers, the transfer functions Wxl and Wxr can 
be determined in the manner as described above. This can be done in 
the following manner. First, loudspeaker PLi is activated and 

15 microphone signals are made zero. The filter settings WxL(k) and 

WxR(k) for said loudspeaker are determined. Thereafter, loudspeaker 
PLi is deactivated and loudspeaker PL2 is activated to determine 
filter settings W'xL(k) and W'xR(k) for loudspeaker PL2. The filter 
functions for both loudspeakers having been determined, the system 

2 0 is capable of reproducing any mix of the two sound sources PLi and 
PL2 with a very natural sound, i.e., stereo sound. 
[0043] For a signal x(k) sent to loudspeaker PLi and, 
simultaneously, a signal y(k) sent to loudspeaker PL2, the signals 
to the headphone sound generating means are: 

25 
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ei(k)= x(k)*WxL(k) + y(k)*W'x,(k) and 



e,(k)= x(k)*WxR(k) + Y(k) *Wx,(k) . 



[0044] 



When more than two sources are to be simulated, the 



5 signals to the more than two sources could, for instance, be 
written as a vector and the filter settings for the different 
sources could be written in matrix form. Multiplication of the 
vector (for the sources) with the matrix (for the settings) will 
generate the signals ei(k) and er(k). The matrix itself is 
10 determined by measurements and may be different for different 
persons and different rooms. 

[0045] A further embodiment of the system in accordance with the 
invention is shown in Figure 4. Having established the transfer 
functions Wxl and Wxr, respectively, W*xl and W'xr for two loudspeakers 

15 PLi and PLs, this knowledge can be used to 'create' using, for 

instance, geometrical principles more phantom sound sources, for 
instance, phantom loudspeakers PL3 and PL4 . Using, for instance, 
thereafter, the above technique of vector-matrix multiplication, a 
'surround sound' may be created. The problem with trying to do so 

20 using fixed filters lies, as already explained, among others, in 
the very individual Head Related Transfer Functions and also from 
local circumstances, such as, reverberation in a room. Starting 
from two known sources, one can, using geometry and/or standard 
techniques, calculate the transfer function for the phantom sources 

2 5 PL3 and PL4 in so far as geometry is concerned but not or much less 
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the other influences. In a system in accordance with the invention, 
said difficulty is resolved for the main part, since use is made of 
actual measurements on an actual head with actual headphones (thus, 
taking into account the relevant HRTF) and in an actual room (thus 
5 at least partly taking into account the reverberation in the room) 
resulting in transfer functions which take these influences in 
account giving much better rendition of phantom sources. 
[0046] A yet further embodiment is shown in Figure 5. The 
headphones (or at least one of them, or the connection between the 

10 headphones) comprise means for measuring the position with respect 
to the two sources PLi and PI2 and/or some fixed reference point. 
Such means can be, for instance, infra-red sources which are sensed 
by sensors in or near the sources PLi and PL2 or infra-red sources 
in or near PLi and PL2 which are sensed by sensors in the 

15 headphones. Such means may also comprise means for generating and 
sensing ultra-sound. In this example, the two 'real' loudspeakers 
are positioned at either side of a television set 51. Near or at 
least at one headphone, an emitter of a signal or sensor for 
localization signals is present and a stationary part of the system 

20 comprises a sensor or emitter for localization signals. 

[0047] As explained before, the transfer functions are 
determined using the microphones 6 and 7 and when the two sources 
PLi and PL2 are turned off, they are then audible in the headphones 
as 'phantom sources'. The transfer functions to simulate these two 

2 5 external sources PLi and PL2 then include the individual HRTF and 
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room-related factors. Knowing the position of the head and the 
filter, using geometric considerations, one or more phantom sources 
PL3 and PL4 can be created, or alternatively or in addition, the 
system may comprise tables with many transfer functions for many 
5 different positions of the listener vis-a-vis the sources. As the 
listener moves in the room, the position of the head vis-a-vis the 
sources PLi and PL2 is regularly measured and used to create phantom 
sources PLi to PL4 at the right places. The 'proper' filter 
functions may then be established either by, for instance, choosing 

10 a filter setting table associated with a position most nearest to 
the actual position or taking some average (for instance, by 
interpolation) of several filter settings corresponding to several 
positions close to the actual position. In establishing the 'proper 
filter functions' for real or phantom sources, use may be made of 

15 the fact that human ear is much more perceptible to sound coming 

from positions in front of the head, than to the back of the head. 
In other words, to create a 'surround sound', it is not necessary 
to have a number of sources equally distributed around the 
listener, i.e., the number of sources to the back of the head may 

20 be less. 

[0048] The examples given so far all start with determining 
filter functions Wxl and Wxr for one or more loudspeakers (or 
channels) phantom or real by regulating the signal ei(k), er(k) sent 
to the headphone sound generating means 4, 5 such that the signal 
25 ri(k), rr(k) measured by the microphone (s) is substantially zero 
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when a signal x(k) is sent to a source PLi, PL2 and extracting 
filter setting data WxrOc), WxL(k) from said measurement. 
[0049] Figure 6 illustrates a different aspect of the invention. 
In this particular aspect, an external source has been used to find 
5 the filter settings Wxl and Wxr for a particular head, which, for 

simplicity, will be called a 'standard head'. These filter settings 
are, however, as explained, dependent on the very individual HRTF. 
For other persons, these settings may not be correct. As explained 
above, one way of overcoming this problem is to measure the filter 

10 functions for any individual person and store the filter function 
setting coupled with data identifying said person. However, 
although such procedure gives excellent results, this is a rather 
complicated procedure. In an aspect of the invention, a different 
route is followed. When the filter settings for a 'standard head' 

15 are correct (i.e., the microphone signal due to the sum of the 
sound of an external source and the microphone sound generating 
means due to a signal x(k) is zero), the external source is shut 
off, and a microphone signal r' ' (k) due to signal sent to the 
headphone sound generating means is measured (or alternatively the 

2 0 headphone sound generating means are shut off and the microphone 

signal due to the external source is measured) . Data corresponding 
to the signal r' ' (k) are stored in the system. When another person 
puts on the headphones, the very same signal x(k) will generate 
with the same filter setting the same signal ei(k) sent to the 

25 headphone sound generating means 4, but a microphone signal r' (k) 
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which differs, due to a difference in HRTF, from the stored signal 
r' ' (k) . In Figure 6, it is schematically illustrated that the 
system, in this aspect of the invention, comprises means for 
comparing the signal r' (k) to the signal r' ' (k) and means 10 for 
5 changing the filter settings WxL(k) and WxR(k) (the latter not being 
shown, for simplicity) such that a comparison between a signal 
registered by the microphones (r' (k) ) and a known or calculated 
signal (r' • (k) , r' ' ' (k) , r' ' ' ' (k) ) show said two signals to be 
substantially the same. A comparison of the signals or data 

10 representing the signal r' (k) and r' ' (k) then shows that the 

signals are substantially the same. Such a comparison can be done 
in different ways. The most simplest is to store data for r' ' (k) 
and to calculate the sum or difference (depending on the sign of 
the stored data) of the data for r' (k) and r' ' (k) . These data may 

15 directly represent the signal r' (k) and r' ' (k) or be some data 

derived from the signals, such derivation being done to reduce the 
data needed for comparison. For instance, the signals r' (k) and 
r' ' (k) may be converted into Fourier space and the comparison may 
be done in Fourier space. The filter settings are then recorded 

2 0 (for instance, in means 8, 9 or 10, but they could also be recorded 
in some other means) and they are used for further sound production 
to simulate an or more external source (s). It should be noted that, 
apart from the shape and size of the head, also other factors may 
be of importance, for instance, the acoustics (reverberations, for 

2 5 instance) of the site at which the sound was generated. In Figure 
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6, r' ' (k) may, for instance, correspond to sound reproduction in a 
concert hall, r' • • (k) to sound reproduction in a stadium, and 
r' ' ' ' (k) to sound reproduction in a small room (chamber or club) . 
The user of the system may choose such settings, to his/her liking. 
5 In this example, the comparison signal r'' (k) etc. are fixed 
signals corresponding with fixed situations. In a more 
sophisticated system, the comparison signal could be more freely 
chosen, for instance, by giving the user the opportunity to change 
the size and acoustic characteristics of the virtual site or the 

10 position of the listener within the site. The basic idea is that 
the signal r' (k) (and such for each channel) is compared to a 
stored or computer-generated signal (be it r» ' (k) , r' ' ' (k) , 
r' ' ' ' (k)) and that the two signals are made substantially the same 
by changing the filter settings WxR(k), WxL(k). 

15 [0050] Figures 7A to 7E illustrate several embodiments of a 
headphone for a system in accordance with the invention. 
[0051] In Figure 7A, a tube 12 is attached to the microphone 6 
of headphone 11, the tube 12 being inserted in the inner ear. In 
this embodiment, in which the headphone 11 has a shell -like 

2 0 construction with the sound generating means inside the shell, it 
is preferred that the microphone registers the sound in the inner 
ear near the eardrum. For this purpose, the tubes 12, as sound 
guides, are provided. In Figure 7B, the headphone is placed inside 
the ear and the microphone 6 near or in the inner ear. In Figure 

25 7C, the headphone 11 and microphone 6 are separate devices but both 
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placed in or near the ear. The output signal of the sound 
generating means is fed to a jack 72, the output signal of the 
microphone is fed to a separate jack 71. In Figure 7E, both output 
signals are fed to a single jack 73 which has two separate ports 75 
5 and 76 through which the signals may be transferred to a part of 
the system. This embodiment is the most preferred embodiment, 
because one single jack is necessary. The part of the sound system 
in which the jack will be inserted may be provided with means for 
picking up the signals. Such a jack can be a standard jack, but for 

10 the extra output, likewise, the part of the sound system in which 
the jack will be inserted may be standard, but for the possibility 
of registering the signal from the microphone. This enables 
'standard' equipment, at least as far as the user is concerned, to 
be used. The sound system will be able to operate with 'normal 

15 headphone' (in which case there will be no microphone signal), but 
will be able to register whether or not a headphone in accordance 
with a system of the invention is used, and if so, operate in 
accordance with the invention. 

[0052] Figure 7D illustrates that the signal {ri(k), rr(k) or any 
20 combination of derivative of or data representing said signals) 
from the microphone can be relayed wirelessly as well as by a 
separate plug. 

[0053] It will be clear that within the framework of the 
invention, many variations are possible. 
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[0054] 



For instance, in the above given examples, the microphone 



is shown as an element separate from the other elements. In other 
embodiments, the headphone sound generating means may itself be 
used as microphone. Figure 8 illustrates very schematically how 
this can be done. Headphone sound generating means 81 comprises or 
is coupled to or with a means 82 for driving a membrane 83 to 
generate sounds. Said system is supplied with a signal lin via an 
input 84. The headphone sound generating means also comprises means 
85 (which may have some, most or even all building elements common 
to means 82) with an output 86 which generates a signal lout 
corresponding to the movement of the membrane. A means 87 for 
regulating the signal Im has an input for signal lout and regulates 
lin such that lout becomes substantially zero when an external source 
generates a sound I. In these circumstances, the sound pressure at 
the position of the membrane is zero; thus it is silent. 
Preferably, for these embodiments, i.e., for the embodiments 
wherein the headphone sound generating means double in function as 
microphones, the headphone sound generating means are, in 
operation, located inside the ear. 

[0055] In short; the invention can be described as follows: 
[0056] A sound reproduction system comprises headphones (11) . 
Said headphones comprise means for generating sound (4, 5) and 
microphones (6, 7) (i.e., means for recording sound). Further, the 
system comprises filter means (8, 9) for filtering a signal such 
that the sound produced simulates external sound sources. These 
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filter means comprise filter setting date WxR(k), WXL(k). The system 
comprises a feed-back and control system (10) in which signals 

(ri(k), rr(k)) from the microphones (6, 7) are used to set the 
settings WxL(k), WxR(k) of the filter means (8, 9), The signals can 
be used by making them zero (when an external source is used) 

(ri(k)=0, see Figure 3) or by comparing the microphone signals and 
a gauge signal zero (r' • (k)-r' (k)=0, see Figure 6) such that the 
two are substantially the same. 

[0057] It should be noted that systems are known, for instance, 
for use in very high noise environments, such as airports, to 
cancel noise. In some of such systems, a microphone inside the 
headphone is used. The headphone sound generating means make a 
counter-noise to cut out or at least strongly reduce all noise 
within a certain frequency bandwidth. The idea behind such systems 
is that by eliminating the usually low frequency noise, the noise 
to signal ratio between the noise and the usually more high 
frequency communication sounds signals is increased. Such systems, 
however, do not simulate external sources nor are the microphone 
signals used to set filter settings. 
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